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Application of recombinant DNA techniques to biology 
is bringing about a revolution in our understanding of 
living organisms. There is no field of experimental bi- 
ology that is untouched by the power we now have to 
isolate, analyze, and manipulate genes. When the Erst 
edition of Recombinant DNA was published in 1983, re- 
combinant DNA techniques were already being used 
extensivelv for the analysis of viral and bacterial genetics, 
but dissection of eukaryotic genes was only just begin- 
ning- There were hints of what was to come. The concept 
of the gene as a continuous stretch of DNA had been 
shattered with the discovery of introns, but alternative 
splicing and genes- within- genes were yet to be revealed. 
Identification of cellular oncogenes seemed to promise 
an understanding of cancer, but the mechanisms of their 
action and the existence of rumor suppressor genes™ 
were still subjects for speculation,. A handful of genetic 
diseases were being analyzed at the molecular level, but 
the isolation of the disease genes and the development 
of gene therapy were yet to come. 

Our aim in writing the second edition ol Recombinant 
DNA is to show how recombinant DNA techniques have 
led to the explosion in our knowledge of fundamental 
biological processes. As in the first edition, which w v as 
subtitled A Short Course, we provide a concise presentation 
of the methods, underlying concepts, and far-reaching 
applications of recombinant DNA technology. The field 
has grown since the publication of the first edition, and 
so has our book. But even though our previous subtitle 
may be. inappropriate for this enlarged edition, our ap- 
proach to the material has remained true to the spirit of 
the "short course": as before, the uninitiated will rind 
access to the field of recombinant DNA here. 

The book is now divided into six major sections. The 
first five chapters, which are largely unchanged from 
the first edition, provide a historical introduction to the 



early development of recombinant DNA technology, up 
to the point when studies of eukaryotic organisms began 
in earnest. In the next section we describe in detail the 
methods currently used to clone and analyze genes, and 
devote an entire chapter to the polymerase chain re- 
action, which has had an extraordinary impact on re- 
search. The great power of recombinant DNA techniques, 
comes from the ability to explore gene functions by 
manipulating genes and then introducing them back into 
cells. The third section of the book discusses how this 
is done in mammalian cells, yeast, mice, and plants. The 
fourth section describes the progress these manipulations 
have allowed in key areas of biology. Here the range of 
recombinant DNA applications is demonstrated, from 
the analysis of cell cycle control and embryonic devel- 
opment, to the isolation of genes involved with brain 
function. Indeed, these techniques have spawned a whole 
industry — biotechnology. In the fifth section, we describe 
some of its accomplishments, including the development 
of genetically engineered pharmaceutical and agricul- 
tural products, and the studies of the human immuno- 
deficiency virus that are leading the attack on AIDS. The 
differences between the first and second editions are 
perhaps most evident in the final section, where we de- 
scribe the revolution in human molecular genetics' and 
the ways in which recombinant DNA techniques are 
providing new methods for diagnosis and treatment of 
human inherited diseases. 

The topics that are covered and the approach we take 
to describing them make this book suitable for under- 
graduate and graduate students in molecular biology, 
ceil biology, biochemistry, genetics, or biotechnology 
courses; for medical students and physicians; and for 
others who have an interest in recombinant DNA tech- 
niques — for example, forensic scientists, patent attor- 
neys, and science journalists. 
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Textbooks dealing with biochemistry, molecular' cr e - 
nencs, and molecular biology usually present information 
wuhoi.t describing the experiments that were done to 
obtain it. We think that this is a pitv, because desicmincr 
and doing experiments is exciting and fun. As in the first 
edition, we have used real experiments to illustrate im- 
portant biological phenomena, and we have plundered 
our colleagues 1 papers for interesting examples. Fimires 
are used profusely to try to make complex real-life ex- 
periments intelligible, but inevitably we have not been 
able to present all the subtle details. Those who want to 
explore these details will find the expenments in the 
research papers listed at the end of each chapter, and 
the review papers we cite will provide an entry point to 
each topic. 



This book is atypical in another regard. Because we 
do not consider it primarily a textbook for conveying 
undisputed facts about molecular biology, we have been 
able to include exciting research that; is at the cutting 
edge of biology. The interpretation of experimental data 
-often changes with time, so the reader should bear in 
mind that future research might require modification of 
some of the ideas we present. This is all part and parcel 
of doing research, because a science that does not change 
is a dead science. Modern experimental research in bi- 
ology is an ever-changing dynamic enterprise, and we 
hope that Recombinant DNA conveys the excitement of 
the continuing process of discovering how organisms 
work 
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In Vitro 




I # ecombinant DNA technology and DNA sequencing provided the tools 
J\ • .to clone and characterize genes. As we learned in Chapter 8, simple 
Inspection of gene sequences told us- much about genomic organization. Func- 
tional sequences, such as. transcriptional control elements, could often be iden- 
tified by comparing sequences of a number of genes. However, to delve deeply 
Into the structure and function of genes required the ability to change the DNA 
sequence and examine the effect of the change on gene function. For decades 
before the advent of recombinant DNA, this was done by classical genetics, the 
Identification of mutant organisms with new properties- From the genetic prop- 
erties of mutants, information about the structure and function of the unde rlying 
genes could often be Inferred. This approach, however, was limited to organisms 
in which simple generic analysis was possible — bacteria, yeast, fruit Mies. Generic 
analysis of more complex, longer-lived organisms like mice and men was slow 
and difficult 

Recombinant DNA changed all that. The ability to Isolate genes as molecular 
clones, the development of tools to modify gene sequences in rhe test rube, and 
the power to return altered genes to the organism to test their function have 
revolutionized the way generics is done in higher organisms. Because we now 
often work "backwards" from gene sequence to gene function, in contrast to 
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FIGURE 11-1 

General strategy for an in vitro mutagenesis experiment. 
Most procedures for in vitro mutagenesis follow the same 
basic scheme: Plasmid DNA is "mutagenized" in vitro, then 
introduced into £ coil by transformation. Depending on the 
method, mutant clones can be isolated and tested individ-- 
ually, or a library of mutant piasmids can be obtained, 
-which are tested using a genetic screen. 



classical generics, this new approach spawned by re 
conihmaru DNA is called reverse genetics. In this chapter 
we will learn ways to alter the sequence of a cloned 
gene at will and how these methods are used to un- 
derstand the structure and function of genes and gene 
products. 

In Vitro Mutagenesis. Is Used 
to Study Gene Function 

In vitro mutagenesis of cloned genes has become a 
standard tool in the functional analysis of nucleic acids 
and proteins. Most procedures follow the same basic 
scheme (Figure 11-1). Plasmid DNA containing the 
gene of interest is treated m vitro by some mutagenesis 
procedure that alters the DNA either chemically or 
enzymatically. The mutagenized plasmid DNA is in- 
troduced into .£ coli by transformation, and colonies 
containing plasmid molecules are selected by anti- 
biotic resistance. Mutants can be made one at a time, 
or hundreds of different mutants can be created in a 
single mutagenesis experiment Mutant piasmids can 
be isolated from single colonies and tested individ- 
ually Alternatively plasmid DNA can be prepared 
from pooled colonies and the resulting library tested 
en masse to identify mutant piasmids. 

The various approaches to mutagenesis can be 
grouped broadly into random and site-directed meth- 
ods. Random methods put mutations anywhere in a 
plasmid. They are best used to identify the location 
and boundaries of a particular function within a cloned 
DNA fragment and are most readily used for this 
purpose when a simple genetic screen (or selection) 
is available. A genetic screen or selection consists of 
a system to test the function of the DNA of interest 
m cells without having to isolate each plasmid indiv- 
idually. Random mutagenesis is often used as a first 
step, when little is known about the function encoded 
by particular DNA fragment. Analysis of random mu- 
tants generally provides only a simple identification 
of the functional region but does not explain how 
things work on a molecular level The value of such 
a strategy is that it quickly helps to narrow down the 
focus of attention from a large DNA fragment to a 
smaller region that can be studied subsequently in 
greater detail As we will learn, random mutagenesis 
can be accomplished by several different methods, 
such as altering the sequences within restriction en- 
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donuclease sites, inserting- an oligonucleotide linker 
randomly into a plasmid. damaging plasmid DNA m 
vitro with chemicals, or incorporating incorrect nu- 
cleotides during in vitro DNA synthesis. 

Once an important functional domain m a gene has 
been identified by random mutagenesis, site- directed 
methods — putting mutations precisely where they are 
needed — are used to define the role of specific se- 
quences. In addition, directed mutagenesis provides a 
powerful tool For the analysis of protein function, by 
allowing researchers to make specific and subtle 
changes in the structure of the protein. A number of 

strategies have been, developed to construct sue-dP 
____ 

lTbe^ltccornp^ synthetic oligo nucleoacles. 

Wkh an oligonucleotide the desired sequ ence is-sim - 
jH^HbTnl^^ the wild-type framework. Nowadays, 
oIT^ira^ reactions are 

relatively straightforward, and oligonucleotides are 
cheap and easy to obtain. The limitation of site-di- 
rected mutagenesis is that you must already have 
enough information to know what you wish to change- 
There are two standard ways of using oligonucleo- 
tides to construct site-directed mutants: mutagenesis 
by gene synthesis and mutagenesis by enzymatic ex- 
tension of a mutagenic oligonucleotide. By using de- 
generate oligonucleotides (see Chapter 7) a set of 
"random" mutadons at a specific site can also be made. 

Restriction Endomiclease Sites Provide 
the Simplest Access for Mutagenesis 

One of the first experiments done with a cloned DNA 
fragment is to map the positions of restriction endo- 
miclease cleavage sites in the DNA by using a battery 
of different enzymes. Although this information could 
be precisely obtained from the DNA sequence, map- 
ping restriction sites can be accomplished rapidly and 
is often done in conjunction with sequencing. Re- 
striction endo nuclease recognition sites provide the 
simplest way to modify a DNA clone in vitro (Figure 
11-2)- Cleaving plasmid DNA with a restriction en- 
zyme that recognizes only one site produces a linear 
molecule. This serves as an entry point for modifying 
the DNA sequence in the vicinity of the restriction 
site. For example, the enzyme EcoRl recognizes the 
sequence GAATTC and produces ends with 5' ov- 
erhangs. The ends can he made even (blunt) by treating 




FIGURE 11-. 2 ' ' 

Creating a mutation by manipu lation of a restriction site- 
Pi as mid DNA is cleaved with EcoRl restriction endonu- 

clease, which generates a linear fragment with 5' ends char 
have four unpaired nucleotides (so-called sticky ends). - 
Treatment with SI nuclease {left} removes these nucleo- 
tides, and the linear fragment is then treated with DNA U 
£ase. The resulting circular molecule contains a deletion of 
4 bp- Alternatively, addition of DNA polymerase and deoxy- 
rlbonucleotide triphosphates (dNTPs) to the plasmid cleaved 

EcoRl extends the 3 5 ends by DNA synthesis (right). 
After ligation, the resulting molecule contains an insertion 
of 4 bp-" In both cases, the EcoRl site has been destroyed. 
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FIGURE 11-3 

Lmker mse[ tlon mm -genesis to map factional domains of a 
bacterial rransposable element. The scarring plasrrnd contains 
an. intact transposon, an ampicillin-resistarice a ene for selec- 
n0i ;^ n E - coli t and sequences for plasmid replication. The 
DNA is treated with a low concentration, of deoxyribonu- 
clease I in the presence of Mrr T Under these conditions, 
the enzyme makes double-stranded cuts at random positions 
in the plasmid, generating a collection of linear DNA mole- 
cules broken at different positions. Oligonucleotide linkers 
encoding an EcoRl restriction site are added to the ends 
with DNA iigase, the linear molecules are' treated with 
EcoRI endonuclease to create, sticky ends on the linkers, and 
the molecules are ^circularized. The circular molecules are 
transformed into E. colt, and ampicillin-resisrant colonies are 
selected- Plasmid DNA is isolated from individual colonies, 
introduced into another strain of £ colt, and tested for activ- 
ity of the transposon. The positions of the inserted linkers 
are mapped by restriction digestion. Linkers inserted in one 
region (blue) of the plasmid inactivated the transposon. No 
linker insertions in the ampicillim-resistance gene were re- 
covered, because these plasm ids would fail to yield a drug- 
resistant colony m the original selection of transformed 
E. coll 



the cleaved DNA with DNA polymerase in the pres- 
ence deoxyribonucleotide triphosphates. The two 
blunt ends can then be linked together again (ligated) 
by incubating the linear plasmid molecule with DNA 
ligase. A few nanograms of DNA from the in vitro 
ligation reaction is used to transform £ colt, and the 
new modified plasmid is isolated from one of the re- 
sulting colonies. The net result of these manipulations 
is to Insert 4 bp into the plasmid at the EcolU site. 
Alternatively, a small deletion mutation can be made 
by treating the linearized DNA with Si nuclease, 
which specifically digests single-stranded DNA. This 
creates blunt ends by removal of the four nucleotides 
that constitute the 5' overhang generated by EcoRI at 
each end. Subsequent ligation' of the DNA Into a co- 
valently closed circular molecule thus results In the 
deletion of 4 bp from the DNA, In each example, the 
new sequence no longer encodes the EcolU recognition 
site. These types of manipulations, if done to a protein 
coding sequence, would change the translation^ read™ ' 
mg frame, resulting m production of a grossly altered 
protein. The major limitation of using restriction sites 
to make mutations is that there simply may not be 
sites in regions of the gene the experimenter wishes 
to alter. 
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Linker Insertion. Is Used to Map a 
Bacterial Transposon 

We have learned that it is a simple matter to cleave 
a plasmid with a restriction enzyme, blunt the ends 
by treatment with a ON A polymerase, and rejoin them 
by ligation. A variation on this technique is to rejoin 
the ends in the presence of a synthetic oligonucleotide 
'linker/' often one that encodes a restriction site. In - 
sertion of the linker disrupts the gene sequence; the 
position of the inserted linker can he easily mapped 
by cleavage of the plasmid with the restriction enzyme 
that cuts the linker. 

A similar method was used to define the functional 
regions of a bacterial transposable element (a "jumping 
gene/' see Chapter 10), by inserting linkers at many 
alternative positions throughout the element- To place 
linker insertions in the transposon, a plasmid carrying 
a clone of the transposon was treated with a nuclease 
that cleaved the plasmids at random positions (Figure 
11-3). Cleavage conditions were adjusted so that each 
plasmid was cut just once on average. The linearized 
molecules were isolated and ligated into circles again 
in the presence of an 8-bp linker oligonucleotide con™ 
taiaing an EcoRl restriction site, resulting in insertion 
of the linkers into random sites, one in each plasmid. 
The resulting plasmids were transformed into E. colt 
and, using a genetic screen, examined to see if the 
transposon could jump. Insertion of a linker into a 
region of the transposon critical for its function in- 
activates it, presumably by putting a protein-coding 
sequence out of frame. By mapping the positions of 
the inserted linkers by restriction analysis, the loca- 
tions of functional regions of the transposon were 
deduced. 



Construction of Nested Deletions Maps 
the Boundaries of a Transcriptional 
Control Region 

Transcription of the gene encoding the 5S ribosomal 
RNA. molecule is carried out by RNA polymerase III 
(pol lit, see Chapter 8). To identify the sequences 
within the 5S gene required for transcription by pol 
III, a series of d eletion mutations was made and tested 



for their ability to support accurate transcription. Two 
sets of deletions were made. One was made by cutting 
a plasmid carrying a cloned 5S gene at a restriction 
site on the 5 r side of the gene. The linearized plasmid 

was treated with a combination of nucleases that di 

crested away DMA from rhe ends of the molecule 
(Figure 11-4). The amount of DNA removed was 
controlled by varying the time, temperature, or en- 
zyme concentration in the reaction. A second set of 
deletions was generated from plasmid DNA cleave d. 
at a site on the 3' side of the gene. The result was 
two sets of plasmids with progressively larger deletions 
toward the gene from both directions. Testing these 
genes revealed drat only deletions entering a 3 5 -bp. 
region within the transcribed region of the 58 gene 
abolished transcription by pol III. Therefore, this dele- 
tion analysis mapped the transcriptional regulatory 
element to this 35-bp stretch, which has subsequently 
been analyzed in much greater detail by site-directed 
mutagenesis. 

Several different types of enzymes can be used to 
produce deletions. Generally, these enzymes delete 
DNA from both ends of a linearized plasmid molecule. 
Often, however, one end of the molecule contains 
sequences that need to be retained in the plasmid 
because, for example, they are required for plasmid 
replication- In the 5S gene deletion experiment, this 
limitation was accommodated by isolating the deleted 
gene fragments and recloning them into a new vector: 
Alternatively, a strategy can be used that limits dele- 
tion to one end of a linearized plasmid molecule (Fig- 
ure 11-5). This method is widely used to generate 
nested deletions for DNA sequencing (see Chapter 7). 



Linker-Scanning Mutagenesis Permits 
Systematic Analysis of Promoters 

Deletion mutagenesis of the 5S gene mapped the 
boundaries of the transcriptional control region in the 
gene. But not all the nucleotides within the boundaries 
of that 35-bp region are necessarily critical for franc- 
tiom Therefore, methods were needed to change in- 
dividual nucleotides in a target without generating 
oross deletions or other rearrangements. This was ac- 
complished for a viral promoter using an elegant ad- 
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FIGURE i 1-4 

Construction of a nested sec of deletion mutants to map rhe 
transcription control region of a 5S ribosomal RNA gene, 
0) A plssmid clone was linearized with a restrktion°enzyine 
at a position (A) on the 5' side of the gene. The linear ffaer- 
menrs were treated with an exo nuclease, which digests 
DNA from both ends of the molecule. Portions ofthe reac- 
uon were removed at different times to recover populations 
of molecules with progressively larger deletions. Linkers 
were added to the ends, and the molecules were cleaved 
with restriction enzymes specific for'sites B and C to sepa- 
rate the SS gene fragments from the remnants of the vector. 
The fragments were recloned into a new vector, generating 
the sec of righnvard deletion mutants. To create the left- 
ward deletion mutants, this process was repeated after 
cleaving the plasmid at restriction sue B (b) Individual plas- 
- raids were isolated after transformation, their deletion end- 
points determined by DNA sequencing, and their ability to 
support transcription by RNA polymerase III tested with an 
in vitro assay. As can be seen by comparing transcription 
■ activity with the extent of deletion, transcription is inhibited 
when the rightward (5') deletions enter the +40 region and 
when the leftward (3') deletions pass the +80 point. This 
suggests that the transcription control region lies between 
+ 40 and +80. 



Set of 
rightward 
deletion 
mutants 



Vector DNA 



Full-length 

55 ribosomal 
RNA gene 



Set of 
leftward 
deletion 
mutants 




40 
i 



A 



80 



53 RNA transcript 
1 20 1 60 200 

> I ! 



Control sequences (40-80) 
lie within transcribed region 



B 



^^^^ • ,TT' ; " ••• - " 



2^ "^^^ 



Vector DNA 



Vector DNA 



{b} 



+ 



+ 

+ 



In Vitro Mutagenesis 



197 




a — 



<y y Cloned cDNA 



Cleave DNA with P$t\ and BamHl 



S' GATCC F^S^il^^^ 

3 ' Gfeiffl^MiSi 



ctgca 



Exonuclease 1U 



5 GATCCfe§S^^M 



IPlPi CTGCA 



GATCC ^^^^CT 



ct&ca 



.Single- stranded ON A 



Nuclease SI 




Ligafe linker 



Promoter 
\ ntoct 




Linker 




cDNA remaining 
after deletion 



Extent of 

deletion 

controlled 

by time 

of digestion 



apration of deletion mutagenesis cali.ee!. linker scanning 
Using- the methods outlined in. Figure 11-4, cwo sets 
o f p i as mi ds w e r e co ns tru c ted that co n t a i n e d d c I en o ns 
within the promoter.. One set of deletions started from 
a site beyond the 5 f end and proceeded toward the 
crene, leaving the 3' end intact: the other set started 
at a point within the gene and proceeded in die op- 
posite direction, leaving the 5 f end intact. Each dele- 
tion terminated with a 10-bp BamHl linker. The extent 
of the deletion in the DNA was determined for each 
pi as mid by DNA sequencing. Pairs of pi as mid s from 
the two deletion sets with end points 10 hp apart were 
recomhined at their BamHl sites (Figure li-6). The 
effect was to preserve the length and organization of 
the promoter— thought to be important for promoter 
function — but to replace various 10-bp segments of 
wild-type promoter sequence with the sequence in the 
linker. Thus, this experiment created a library of pro- 
moter mutants of similar structure but with nucleotide 
substitutions clustered within 10-bp windows located 
at various sites in the promoter. This collection of 
mutants spanned the length of the promoter. The re- 
sults of this analysis were discussed in Chapter 9. At 
the time 7 this experiment represented the most thor- 
ough analysis of a promoter in a mammalian gene. 



FIGURE 11-5 

Construction of unidirectional deletions using exo nuclease 
III. Exonuclease III attacks preferentially the 3' end of a lin- 
ear DNA molecule with 5' protruding nucleotides. There- 
fore, by cleaving a plasmid molecule at adjacent sites with 
BamHl, which leaves a 5' overhang, and /Vd, which leaves a 
y overhang, only the end generated by BamHl is attacked 
by exonuclease ILL After exonuclease HI treatment, the re 
maining single-stranded tail (along with the overhang at the 
other end) is removed with Si nuclease, which digests only 
single-stranded DNA. An oligonucleotide linker is attached, 
and the fragments are heated to form closed circular mole- 
cules. In the experiment shown here, deletions are being 
used to map the functional domains of a cloned gene in- 
serted in an expression vector This strategy allows dele- 
tions to be made only -in the cloned gene, without damaging 
the promoter sequence. 



Random Nucleotide Substitutions Are 
Obtained by Chemical Modification of 
DNA or by Enzymatic Misincorporation 

While linker scanning allows the creation of nucleo- 
tide substitutions, each mutant generally contains sev - 
eral substitutions, and the positions of the mutations 
depend on the availability of appropriately placed 
deletions. Therefore, several strategies have been de 
ve loped for placing single nucleotide substitutions at 
random positions in a DNA molecule. The simplest 
methods employ chemicals that modify or damage 
DNA Generally, plasmid DNA or DNA fragments 
are treated with chemicals, transformed into E. colu 
and propagated as a library of mutant plasmids. Chem- 
icals most commonly used for in vitro mutagenesis 
include sodium bisulfite, which deaminates cytosine 
residues to uracil, and reagents that damage or remove 
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FIGURE 11-7 

Chemical mutagenesis using sodium, bisulfite. Sodium bisul- 
fite reacts with cytosine bases of single-straaded DNA to 
convert them to uracil, a thymine analog that base-pairs 
with adenine. Single-stranded DNA is treated with sodium 
bisu lfite to modify a small -number of cytosine residues in 
each molecule. An oligonucleotide primer is annealed to the 
DNA and serves as a primer for synthesis by DNA poly- 
merase. When the polymerase encounters a uracil in the 
template strand, it incorporates an adenine into the newly 
synthesized DNA, Since the vector sequences are also dam- 
aged by bisulfite treatment, it is necessary to excise the dou- 
ble-stranded DNA fragment by restriction endonuclease 
cleavage and redone it into an undamaged vector. Follow- 
ing transformation into R colt, a library of mutant plasmids 
can be isolated or individual plasmids can be purified and 
tested. The average number of substitutions in the DNA 
fragmen t can be controlled by altering the conditions of bi- 
sulfite treatment. 



bases, thereby preventing normal Watson-Crick base- 
pairtng (these include hydrazine and formic acid, 
which are used in Gilbert- Max am DNA sequencing, 
Chapter 5), Most often, chemical mutagenesis is per 
formed oa single-straaded DNA and followed by in 
vitro synthesis of the complementary strand using a 
DNA polymerase (Figure 11-7). This synthesis in- 
corporates the mutation into the new strand. In DNA 
treated with bisulfite, an adenine nucleotide is incor- 
porated opposite the uracil; after transformation into 
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FIGURE 11 -8 

Oligonucleoade-directed mutagenesis by enzymatic primer 
extension. A "mutagenic" oligonucleotide encoding the de- 
sired mutation embedded in wild-type flankincr sequence is 
annealed ro a. single-stranded DNA template. The sequence 
ot the oligonueleoude is complementary to the template ex- 
cept tor the nucleotides that define the mutation. Generally, 
the mutagenic oligomer is designed, so that the mismatched' 
nucleotides are positioned m the middle and there are at 
least 8 to 12 nucleotides on either side that base-pair whh 
the template DNA, -The mutagenic oligonucleotide serves as 
a primer for DNA synthesis by DNA polymerase. Once rhe 
entire template has been copied, the end/ of the newly syn- 
thesized strand are covalently linked by DNA hgase Th- 
heteroduplex DNA is transformed into E colt. Theoretically 
both strands can replicate, segregating into separate mutant' 
and wild-type plasmids. In practice, however, most colonies 
contain only one or the other, because enzymes m the cell 
recognize and repair mismatched nucleotides in the hetero- 
duplex before replication. Plasmid DNA is Isolated from the 
resulting colonies and is screened to identify mutants. 
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K coh] the wild-type OG base pair becomes a T-A 
pair. In DNA treated with reagents that eliminate 
bases, any nucleotide can be incorporated opposite die 
' abasic" site, which still retains its deoxyribose back- 
■ bone although it has lost its base. The major Limitation 
of chemical mutagenesis is the specificity of the in- 
dividual reagents: bisulfite mutagenesis, for example, 
changes only cytosines. 

All possible nucleotide substitutions can be gen- 
erated using enzymatic misincorpbration. Here" the 



strategy is to perform in vitro DNA synthesis under 
nomdeal conditions— suboptimal ionic conditions, 
unbalanced concentrations of nucleotide precursors— 
that encourage DNA polymerase occasionally to in- 
corporate the wrong nucleotide during synthesis. For 
example, synthesis is carried out in the presence of 
high concentrations of three of the precursors and a 
very low concentration of the fourth. At positions that 
normally call for the fourth (scarce) nucleotide, one 
of the others is sometimes incorporated instead These 
methods also exploit DNA polymerases that lack a 
proofreading activity -a V to 5 r exonuclease mech- 
anism drat checks each base pair after incorporadon 
and removes nucleotides that are mismatched. Therrnus 
aquaticus (Taq) DNA polymerase, used in the poly- 
merase chain reaction (Chapter 6), lacks such an ac- 
tivity. Though this is a problem when accuracy of 
synthesis Is required, the PGR is a very simple and 
efficient way to introduce random nucleotide substi- 
tudons into a DNA fragment. 

A general problem with random mutagenesis ap- 
proaches is that they often produce mutants with more 
than one substitution. Multiple substitutions m a single 
mutant complicate the interpretation of an experi- 
ment, because it isn't clear which substitution (or 
which combination of substitutions) is responsible for 
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observed changes in rhe pro perries of the mutant. Ex- 
traordinary methods have been used, to circumvent 

this problem- -essentially, significantly reducing the 

extent of mutagenesis and using enrichment protocols 

to find rare mutants-- —but almost ah these procedures 

have been supplanted by new methods that use syn- 
thetic oligonucleotides. 



S y n t h e 1 1 c Oligonucleotides 
Facilitate Mutagenesis 

Most of the methods for mutagenesis we have dis- 
cussed so far have some significant shortcoming— they 
rely on fortuitous access to a sequence via a restriction 
site, forced entry through deletion strategies, or te- 
dious screens to find randomly generated mutations 
in the region of interest. To be most powerful, mu-. 
tagenesis must allow the experimenter to place any 
mod inc a ti on at any position desired In clorieo^TLTNTL 
TrTTs7n"aTl^ecome not o nl^plislmbie, but simple and 
cheap ? with tire advent of synthetic DNA oligonu" 
cleotides. Oligonucleotides provIHe the means to de~- 
sign a particular mutation and then to place it precisely 
where you want it ' ' 

The simplest method for doing oligonucleotide- 
directed mutagenesis is by enzymatic primer extension 
(Figure . 1 1-8). In this method, an oligonucleotide is 
designed that carries the mutation flanked by 10 to 
15 nucleotides of wild- type sequence. This "muta- 
genic' 7 oligonucleotide is hybridized to its comple- 
mentary sequence in single-stranded wild- type DNA 
prepared from a phage or phagemid clone, forming a 
heteroduplex with mismatched nucleotides at the site 
of the mutation. Although the oligonucleotide is not 
perfectly complementary, it will, anneal if the hybrid- 
ization conditions are not very stringent. The oligo- 
nucleotide serves as a primer for in vitro enzymatic 
DNA synthesis by a DNA polymerase that converts 
the single-stranded DNA into double stranded form, 
using the wild- type strand as template. In this way f 
all regions of the plasmid except the region containing 
the mutagenic oligonucleotide will be wild-type in 
sequence. Once the primer has been extended com- 
pletely around the template, the ends of the newly 
synthesized strand are ligated, forming a double- 



stranded circular DNA molecule." This heteroduplex 

DNA - -one strand has the wild- true sequence and the 

other strand .has the mutant sequence is transformed 

into E, colt, where either strand can be replicated. By 
the time a colony grows up, however, it usually con- 
tains only one type of plasmid, wild-type or mutant. 
The types of mutations that can be made by this ap- 
proach range from single nucleotide substitutions to 
cleletions oFinsertions, limited only by the size of the 
oligonucleotide needed. 



Mutant Clones Can Be Identified by 
Hybridization and DNA Sequencing 

Theoretically, half the daughter molecules of a mu- 
tagenesis reaction will be wild -type and half mutant. 
In practice, however, the precentage of mutant plas- 
mids is often much lower. This is due to a variety. of 
technical factors, but the consequence is that methods 
for identifying or enriching mutant clones are vital. 
Mutant molecules can be distinguished from wild-type 
if there is gam or loss of a restriction site. Alternatively, 
the oligonucleotide that was originally used to make 
. the mutation can be used as a hybridization probe to 
distinguish mutant from wild-type molecules (Figure 
11-9). The mutagenic oligonucleotide is radio actively 
labeled with 32 P ATP and hybridized to DNA from 
bacterial colonies on nitrocellulose filters, as described 
in Chapter 7. If the temperature of the hybridization 
is raised in 5 or 1G"C increments, a point can usually 
be reached at which the labeled oligonucleotide will 
hybridize only to the mutant molecules (to which it' 
is perfectly complementary) and not to the wild-type 
molecules, because the hybrid is destabilized, by the 
mismatched nucleotides. Plasmid DNA is isolated 
from an £1 coli colony that strongly hybridizes to the 
probe. Verification that the desired mutation was made 
is accomplished by sequencing the DNA of this pu- 
tative mutant clone. This technique can identify one 
mutant clone among several hundred wild- type clones. 

Several clever methods enrich for mutant clones so 
that the tedious task of screening by hybridization is 
not necessary. In one of diese techniques, the template 
DNA is biologically marked so that it is destroyed 
after transformation into £ coli and the mutant strand 
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FIGURE 1 i 9 

Searching for mutant" piasmids using >h. mu caeenic oligonu- 
cleotide as a probe. Colonies (or plaques) resulting from 
era a storm anon, by murage razed piasmids (see Fiacre 11-R) 
are prepared for colony hybridization en nitrocellulose hb 
ters using methods described m Chapcer 7. The mutagenic 
oligonucleotide is radioactively labeled by phosphor/latin^ 
its y end using J2 P-ATP and polynucleotide kinase. The Li- 
beled oligonucleotide is hybridized to the plasmid DW on 
the nitrocellulose hirers. At low temperature, the oligonu- 
cleotide will hybridize ro both mutant and wild-type DNAs. 
As the temperature is increased, the mismatched oligonu- 
cleotide hybridized to the wild-type plasmid DNA begins to 
dissociate from the wild-type clones. Eventually a tempera- 
ture is reached ar which the mismatched oligomers com- " 
pletely dissociate from the wild-type clones but remain 
hybridized to the mutants. Since the oligonucleotide is ra- 
dioacnvely labeled, the nitrocellulose fiber is exposed to 
x-ray film and mutant clones are identified by the presence 
of a strong signal on the autoradiograph. Mutant plasmid 
DNA is then isolated from the corresponding colony on the 
master plate, using the replica filter as a euide. : 



is preferentially replicated (figure 1 1-10). In a second 
method, the template strand is enzymatically de- 
stroyed before transformation. Both methods can yield 
mutants at a frequency of greater than 50 percent, so 
that plasmtd DNA is simply isolated from three or 
four randomly picked colonies and analyzed by DNA 
sequencing with the expectation that a mutant will be 
found among the DNA selected. 



Oligonucleotide Cassettes Provide a 
Simple Method for Introducing 
Directed Mutations ' ^ 

We learned earlier that restriction enzyme sites pro- 
vide access to a cloned DNA for mutagenesis. If two 
restriction sites are close together, the intervening 
fragment can be removed and replaced with a synthetic 
double-stranded fragment (a cassette) made from two 
complementary single-stranded oligonucleotides car- 
rying any desired sequence. Often, however, conve- 
nient restriction sites are not available; fortunately, it 
is a simple matter to create them using the oligonu- 
cleotide-directed mutagenesis procedures described in 
the previous sections. Once the sites are in place, any 



In Vitro Mutagenesis 



203 



FIGURE I i -10 

Enrichment for oligomicleotide-d i rected mutants by using a 
utacil-contaming template. Single-stranded template DNA is 
prepared in a strain- of E. col: that lacks the enzyme uracil 
degiycQSKb.se (ung"), so that it contains several, uracil resi- 
dues in place of thymines. (Although uracil is not usually 
incorporated into DNA, it is not actually mutagenic and it 
does form a base-pair with adenine.) The mutagenic oligo- 
nucleotide is annealed and primes the synthesis of a strand 
that extends around the template in a reaction using the 
four standard dNTPs (as in Figure 11-8). Following ligation, 
the heteroduplex DNA molecules are introduced into an 
ung + strain of E. coll Once in the cell, the wild- type (tem- 
plate) strand is attacked by uracil deglycosidase, which 
causes breaks m the DNA strand, and the DNA strand is 
degraded before it can be replicated. Since the strand con™ 
taming the mutagenic oligonucleotide does not contain ura- 
cil, it is not attacked and is replicated normally. When this 
procedure is used, 50 percent or more of colonies contain 
mutant plasmids. 
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number of new mutants can be made by inserting 
synthetic fragments into the plasmid (Figure 11-11), 
just as different cassettes can be inserted into a tape 
player. 

This method of cassette mutagenesis was the basis 
for an elegant experiment that verified a structural 
model for DNA recognition by phage repressors. The 
repressors of the A-like phages 434 and P22 contain 
a helix- turn-helix structure (see Chapter 9) that rec- 
ognizes the operator DNA in the phage genome. It 
was hypothesized that amino acid side chains on one 
face of an a helix in the repressor protein make se- 
quence-specific contacts with operator DNA To test 
this hypothesis, a helix swap was performed (Figure 
11-12), Oligonucleotides were synthesized that en- 
coded the amino acids of the helix in the 4 34 represso r, 
with the five positions thought to contact DNA 
changed to those found in the P22 repressor. This 
synthetic fragment was swapped for the natural frag- 
ment in the 434 gene. The resulting hybrid protein 
gained the recognition specifi city of the P22 repressor, 
demonstrating that this helix indeed contacts the 
DNA 

Cassette mutagenesis with degenerate oligonucleo- 
tides can be used to create a large collection of random 
mutations in a single experiment. This method was 
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FIGURE 11-11 

Mutagenesis by casserre replacement. Plssmid DNA. is 
cleaved with resciicnon enzymes EcoRl and itfwdiri, which 
cur at sues chat flank the sequence to be mutated. The small 
cleaved DNA fragment contamina a portion of the wild- 
type sequence is removed, and a DNA fragment (cassette)' 
containing the desired mutation Is ligated into the plasmid. 
■Tins murant DNA fragment is composed of two comple- 
mentary "synthetic oligonucleotides chat have EcoRl and " " 
Htndlll sticky ends when annealed. Because there is no het- 

erodupiex intermediate- -the mutant cassette Is simply 

swapped, for the wild type fragment— the recombinant plas- 
mids are all mutants. A mutant cassette can be composed of 
degenerate oligonucleotides (see Chapter 7), resulting m a 
library of mutant plasmids containing different sequences. 



colonies contain mutant plasm id 



GRE function precisely,, single point mutations 
throughout the 30-bp region were generated and 
tested in cells for mducibiilty by glucocorticoid hor- 
mone. Two complementary oligonucleotides were 
synthesized that carried the 30-bp GRE, but synthesis 
was performed under conditions in which incorrect 
nucleotides were incorporated at a low frequency (Fig- 
ure 11-13). These "doped" oligonucleotides (that is, 
oligonucleotides produced by doping; see Figure 
11-13) were annealed and inserted as a cassette into 
■a promoter that lacked a GRE. Using this method, 
most single-nucleotide substitutions at the 30 positions 
were obtained. Such a collection of mutants would 
have been unthinkable before oligonucleotides revo- - 
Iutionized m vitro mutagenesis. 



Gene Synthesis Facilitates Production 
of Normal and Mutant Proteins 



used to study the structure of the glucocorticoid re- 
sponse element (GRE), an enhancer-, sequence that 
activates a family of genes in response to certain steroid 
hormones. The element had been mapped by deletion 
mutagenesis to a 30-bp region in a glucocorticoid-' 
regulated gene. To define the sequence required for 



The oligonucleotide-duected mutagenesis methods 
we have described use a single oligonucleotide or a 
parr of complementary oligonucleotides to Insert mu- 
tant sequences into an otherwise natural DNA frag- 
ment: With jhe^Jncreas ing availability .of longer 
oligonuc leotides, it is nj } w ifr^ 
tire gene I rcuni^symti 
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FIGURE 11-12 

The helix swap experiment. Amino acids in the phage 434 repressor protein believed re - 
sponsible for recognition of the 434 operator were changed by cassette mutagenesis (Figure 
11 LI) of 434 DNA to the amino acids believed to perform the same function in an analo- 
gous region, of phage P22 repressor protein, (a) Expression in £. colt of the 434 repressor 
protein (left), with an enlargement of the site believed to bind the 434 operator; (tight) the 
corresponding section of the P22 repressor protein, (b) A cassette was synthesized resembling 
the 434 domain, but with P22-cype substitutions at positions thought to be essential for re- 
cognizing P22 operator DNA. This was Hgated into the digested 434 plasmid, and the re- 
combinant vector was introduced into £ coli to produce the hybrid protein, which then 
recognized P22 operator DNA but not the 434 operator . 
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FIGURE 11-13 

Cassette mutagenesis using doped' oligonucleotides r 0 gen 
are numerous mutants in a single experiment An oh^orra,- 
cieouae cassette eacoding the glucocorticoid response 
element (GRE) was synthesized by a DNA synods ma- 
chine. Synthesis was done under conditions m which each 
■ bottle containing a particular nucleotide precursor was 
"contaminated" (doped) with small amounts of the other 
three precursors. In the example above, the DNA synthes- 
izer was instructed xo make an oligonucleotide with the se 
quence GGTTACAAACT. Thus/when a nucleotide 

precursor is called for — a C for example -the machine 

adds an aliquot of the solution from the C bottle, and a C 
base is coupled to the end of most of the oligonucleotide 
chains. However, because die C bottle contains a small 
■ amount, of A, G and T E an incorrect base is sometimes 
added instead. Since the concentration of C Is roughly 30 
times chat of A, G and T, an Incorrect base will be added t 
about 1 out of 30 molecules. This results in a doped collec- 
tion of oligonucleotides, which actually consists of many 
different sequences, some wild- type and some with subsritu 
turns. -The level of contamination was adjusted to favor syn- 
thesis of oligonucleotides with only one substitution, but* 
because substitutions occur randomly, some molecules m 
the collection had none and others had two or more. Czs~ 
settes were formed by annealing complementary doped oli- 
gonucleotides and hgated into a vector. Plasmid DNA was 
isolated from 546 individual R coli transformers and ana- 
lyzed by sequencing. Of these, 224 were wild- type, 218 con- 
tained one substitution (for the 30 bases, of interest, 74 of 
the 90 possible single substitutions were recovered), and the 
rest contained two or more. 
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synthesis, the ex- 
perimenter has total co ntrol oy erlthel^uence of the 
gene. It can be wild-tvg e_^ wav re „ 

qmred Because most amino acids are encoded by 
multiple triplet codons, genes encoding wild-type pro- 
teins can be constructed nsmg different. codons. Co 
dons can be chosen to place unique restriction sites 
throughout the sequence so that mutant cassettes can 
be czsily swapped m. This was done with the bacterial 
rhodopsin gene. Replacing a fragment of the synthetic 
gene with a new synthetic fragment identified the 
amino acid that is linked to the photon-absorbing 
chromophore that initiates photosynthesis. Other frag- 
ments can be exchanged as cassettes to study other 
important structural features of the proteim 

Codons can also be changed by gene synthesis to 
allow production of proteins at high levels in other 
organisms. Studies of the biochemistry of the Fos pro- . 
tein, encoded by a cellular protooncogene In animal 
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FIGURE 1 1-14 

Gene synthesis by ligation of complementary oligonucleo - 
rides.. To synthesize a gene char encodes a protein of inter- 
est; s set of overlapping complementary oligonucleotides are 
designed char can be combined to form a double-stranded 
DNA molecule that encodes the entire protein. The oligo- 
nucleotides are mixed together, heated at 90 U C for a few 
minutes to denature the strands, and then cooled slowly to 
room temperature. During this period the oligonucleotides 
anneal through complementary base pairs. The oligonucleo- 
tides are designed so that each one anneals ro two adjacent 
oligonucleotides from, the opposite strand, bridging them. 
Generally, oligonucleotides ranging in. length from 40 ro 80 
nucleotides are used in gene synthesis. The annealed oli- 
gonucleotides are covalently linked by DNA ligase, produc- 
ing two contiguous DNA strands. This synthetic gene is 
usually purified from a gel before ligation into a vector. The 
resultant recombinant plasmid is obtained following trans- 
formation into E. coli and is sequenced to check that the 
correct sequence was synthesized. The sequence of the syn- 
thetic gene can be designed to place restriction sites at con- 
venient locations for cassette mutagenesis. 



cells (Chapter 18), have been severely hampered by 
the inability to produce the protein in E. colt. This 
problem was finally solved by synthesizing a portion 
of the fos gene entirely from oligonucleotides, chang- 
ing natural fos codons to the codons used most effi- 
ciently in E. coll Insertion of this synthetic gene into 
'an E. coli expression vector allowed for the first time 
the production of large quantities of active Fos protein. 
The gene was also designed with several unique re- 
striction sites so that efficient cassette mutagenesis can 
now be coupled to the biochemical assays for Fos 
function. 



The PGR Caa Be Used to Construct 
Genes Encoding Chimeric Proteins 

The ease with which mutations can be made in a 
protein coding sequence has revolutionized the study 
of protein function. A functional domain can be iden- 
tified by making a series of mutant pro te his, then 
testing which substitutions cause a change in function. 
However, it is not often easy to decide where to make 
a mutation. In the example of the helix swap exper- 
iment (Figure 11-12), the domain that bound DNA 
had been, previously identified And the design of the 
experiment was guided by having a model for the 
three-dimensional structure of the repressor protein. 



DNA sequence or 
ammo acid sequence 



c 



S3 



Complementary 

synthetic 

oligonucleotides 



Heat at 90° C 




Slowly cool to room temperature 




"- / A- 
Intermediates 



DNA iigase 



ZJ Isolate double-stranded 
iiiCi '1 DNA m ol ec u I e 




DNA ligase 



Expression vector 



Transcriptional 
terminator 




Unique restriction 
sites for cassette 
mutagenesis 



Express recombinant 
protein in E. coli 



However, for most proteins, little structural infor- 
mation is available. Identifying a functional domain — 
for example,, a region of the protein that may interact 
with another protein — is difficult to do by inspecting 
the primary amino acid sequence. A simple strategy 
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FIGURE 11-15 

. Construction of a chimeric antibody heavy-chain -encoding gene by "sticky feet-directed" 
mutagenesis Antibodies containing a yZb heavy chain are known to participate m comple- 

Senn Jrht co ™™»S V 1 chains do not In order to 

identify which domain of the y2b heavy chain is responsible for this property, an antibody 

heavv n cha g in f^T T * ^ T ° COnSttUtt 3 S Cne en «*W ^ chimeric 

obr/d T' k T P , ^ agment C "'-" (i '". t ' che C « 2 domain from a yl heavy chain was "re- 
placed with the homologous segment from a ylb gene. Since there were no convenient re- 
daction sues at the ends of the C H ? segments, the 400-nucleotide-long y2b- DNA was 
prepared . by PCR. The PGR primers were complementary to the ends of the yZb DNA but 
■contained -additional nucleotides (the sticky feet) that were complementary to yl DNA at the 
boundaries of the y l C H 2 domain The strands of the PCR-generated fragment were sept 

T^Ll r mg ' StTar fy 3S used 35 ^ Punier in a mutagenesis experiment usin. 

a uiaal-contammg single-stranded yl DNA template by the method shown in Ficnire 11-10 

n ^r?, l| ng C f menC hea 7'f h T gene w «-Coexpressed with a light chain gene m mam-' 
m.han cells to form an annboay that now activated complement. Since only the C H 2 domain 

are j roni the y2b heavy-cham, this result demonstrated that the ,2b C„2 domain contains 
cfu information necessary to activate cornplement-dependenr cell lysis. Sticky feet-directed 
mutagenesis provided a simple means for constructing this complicated gene! 
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that helps to narrow down important ammo acids in 
a. protein is the analysis of chimeras between, related 
proteins. We have previously discussed the use of 
computet programs to identify related proteins by 
comparison of their ammo acid sequences (Chapter 
8). Chimeric proteins are constructed, by replacing a 
segment of one protein with the homologous segment 
from another protein. Although the two proteins have 
functional differences, their sequence similarity often 
indicates that they share a common overall structure. 
A striking example of this was in the anal ysis of human 
growth hormone (hGH).. A series of chimeric proteins 
were made in which most of the amino acids were 
derived from hGH but which contained segments from 
related hormones, such as human prolactin. Using this 
strategy, regions of hGH that interact with the hGH 
receptor were identified. In Chapter 17, we will see 
how functional regions of a receptor which spans the 
membrane seven times were identified by the study 
of chimeras- 

The 434/P22 repressor (Figure 1112) and hGH 
chimeras were constructed by ligation of short oli- 
gonucleotide cassettes into the coding sequence. A 
different strategy (Figure IF 15) was used to prepare 
a chimeric antibody in which a 4G0-bp segment from 
a yl heavy-chain gene was replaced by the homolo- 
gous segment from a ylh gene, A 400-bp DNA frag- 
ment was generated by PGR that encoded the new 
sequence to be inserted and two 30-base "sticky feet" 
on each end. The double -stranded PGR fragment was 



heated to denature the two strands, and then, one of 
the single -stranded molecules was utilized in a unmet ■ 
extension experiment (as in Figure 11-8). Had the 
gene synthesis method, been employed, construction 
of the chimeric gene would have required twenty 40- 
nudeotidedong oligomers- Instead, the sticky feet, 
method used only two oligonucleotide primers for 
PGR. 

Mutagenesis Is the Gateway to Gene 
Function and Protein Engineering 

It would be difficult to overestimate the importance 
of in vitro mutagenesis techniques to biology and bio- 
technology. The harnessing of enzymes that operate 
on DNA and the refinement of oligonucleotide syn- 
thesis have made changing gene sequences an almost 
trivial task. And the ability to operate on DNA lets, 
us also change the structure of the products of genes — 
RNA and, most importantly, proteins. Thus, the im- 
pact of this technology is twofold. It has revolutionized 
how research is done in molecular biology by creating 
the entirely new concept of £t reverse, genetics" — ■ 
changing gene sequence first, then examining gene 
function. And it opens the door to sophisticated protein 
engineering (see Chapter 23) 7 the ability to make 
changes in natural gene products that make them do 
their jobs better. The impact of protein engineering 
on medicine and industry will be substantial 



Reading List 

General 

Wu, R., L. Grossman, and KL. Moldave, eds. Methods in En- 

'zyrnofagy, vol 100: Recombinant DNA, Part B. Academic, 

New York, 1983. 
Gait, M. Oligonucleotide Synthesis: A Practical Approach. IRL, 

Oxford, Eng n 1984. 
Wu, R., and L. Grossman, eds. Methods in Enzynwlogy, vol 

154: Recombinant DNA, Part E. Academic, New York, 1987. 



Original Research Papers 

STRATEGIES FOR IN VITRO MUTAGENESIS 

Botstein, D. 7 and D.-S horde. "Strategies and applications of 
in vitro mutagenesis." Sciertce, 229: 1193 — 1201 (1985). 
[Review] 

Smith, M. "Ia vitro mutagenesis." Ann. Rev. Gen,, 19: 423 — 
462 (1985). [Review] 




C I • AFTER 11 



Zoiier, M.J. "New molecular biology methods for protein 
engineering." Cure. 0pm. Struct Biol., 1: 605-610 (1991), 
■ i [Review] 

INSERTION MUTAGENESIS 

Heffron, F., M_ So, and B J. McCarthy. "In vitro mutagenesis 
of a circular DNA molecule using synthetic restriction 
sires." Proc Nad Acad. Sa. USA, 75: 6012-601.6 (1978). 

Abraham, J.J". Feldman, K. A. NasmythJ. N. Strathern, A, 
J. S. KlarJ. R. Broach, and J. B. Hicks. "Sites required 
for position-effect regulation of mating type informa- 
tion in- yeast." Cold Spring Harbor Symp. Quant Biol, 47: 
989-998 (1983). 
Rees Jones, R. W, and S. P. GofE "Insertional mutagenesis 
of the Abelson murine leukemia vims genome: iden- 
tification of mutants with altered kinase activity and 
defective transformation ability." J. Virol, 62: 978- QS6 
(1988). 



DELETION MUTAGENESIS 

Lai, C. J., and D. Nathans. "Deletion mutants of SV40 
generated by enzymatic excision of DNA segments 
from the viral genome;'/. Mol Biol, 89: 1 79-193 (1974). 
Mertz, J, E, J. Carbon, M. Herzberg, R. W. Davis, and P. 
Berg. "Isolation and characterization of individual 
clones of simian virus 40 mutants containing deletions, 
duplications and insertions in their DNA." Cold Spring 
Harbor Symp, Quant. Biol, 39: 69-84 (1974). 
Sakonju, S., D. Bogenhagen, and D. D. Brown. "A control 
region in the center of the 5S gene directs specific 
initiation of transcription, I: the 5' border of die region." 
Cell, 19: 13-26 (1980). 
McKnight, S. L., and R. Kingsbury. "Transcriptional control 
signals of a eukaryotic protein -coding gene." Science 
217:316-324 (1982). 
Struhl, K. "The yeast his3 promoter contains at least two 
distinct elements." Proc, Natl Acad. Set. USA, 79: 7385- 
7389 (1982). 



CHEMICAL MUTAGENESIS 

Chu, C. T, D. S. Parris, R. A. F. Dixon, F. E. Farber, and 
P. A. SchafFer.-"Hydroxylamine mutagenesis of HSV 
DNA and DNA fragments: introduction of mutations 
into selected regions of the viral genome." Virol, 98: 
168-181 (1979). 

Shortle, D., and D. Nathans. "Regulatory mutants of simian 
virus 40: constructed mutants with, base substitutions 
at the origin of viral DNA replication." J. Mol Biol 
131: 801-817 (1979). 



Werner, H. ; and H. Schaller. ^Segment-specific mutagenesis: 
extensive mutagenesis of a lac promoter/operator ele- 
ment." Proc. Natl Acad Set. USA. 79: 1408—1412 (198?) 

Myers, R... M, L S. Lerman, and T. Maniatis. "A general 
method for saturation mutagenesis of cloned DNA frag- 
ments." Science, 229: 242-247 (198 5) 

ENZYMATIC MISINCORPORATTON MUTAGENESIS 

Shortle, D. P. Grisafi, SJ. Renkovic, and D. Botsteim "Gap" 
misrepair mutagenesis: efficient site-directed introduc- 
tion of transition, transversion, and frarneshift muta- 
tions in vitro." Proc. Nad Acad Set USA 79- 1588-1592 
(1982). 

Zakour, R. A., and L. A. Loeb. "Site-specific mutagenesis 
by error-directed DNA synthesis/ 5 Nature 295" 708- 
710 (1982). 

Abarzua, P. ; and K.J xVIarians. "Enzymatic techniques for 
the isolation of random single -base substitutions in vi- 
tro at high frequency." Proc. Natl Acad. Set. USA 81 - 
2030-2034 (1984). 

Leung, D, W., E. Chen, and D. V. Goeddel. "A method for 
random mutagenesis of a defined DNA segment using 
a modified polymerase chain reaction." Technique t 
11-15 (1989) 

OLIGONUCLEOTIDE DIRECTED MUTAGENESIS 

Hutchinson, C. A., Phillips, M. H. Edgell, S. Giliam, P. 
Jahnke, and M. Smith. "Mutagenesis at a specific po- 
sition in a DNA sequence;' J. Biol Chew., 253: 6551- 
6560 (1978). 

Wallace, R. B., M. Schold, M. J. Johnson, P. Dembek, and 
K. Itakura. "Oligonucleotide-directed mutagenesis of 
the human /?-giobin gene; a general method for pro- 
ducing specific point mutations in cloned DNA " Nuc. 
Acids Res., 9: 3642-3656 (1981). 

Zoller, M. J., and M, Smith. "Oligonucleotide-directed mu- 
tagenesis using Ml3-derived vectors: an efficient and 
general procedure for production of point mutations 
in any fragment of DNA." Nuc. Acids Res., 10: 6487- 
6500 (1982). 

Kramer, W, V. Drutsa, H. WJansen, M. Pflugfelder, and 
H.-J. Fritz. "The gapped duplex DNA approach to 
oligonucleotide-directed mutation construction. 1 ' Nuc. 
Acids Res., 12: 9441-9456 (1984J 

Kunkel, T. A. "Rapid and efficient site-specific mutagenesis 
without phenotypic selection" Proc. Natl Acad Sci USA, 
82: 477-492 (1985). 

Taylor J. W.J. Ott, and F. Eckstein. "The rapid generation 
of oligonucleotide-directed mutations at high fre- 
quency using phosphorothioate-modified DNA." Nuc 
Acids Res, 13: 8765-8785 (1985). 



la 



tro 



Mutagenesis 



211 



CASS r - if E M U T AGENESIS 

Lo, K.-M., S. S. Jones, N. R- Hackett, and H. CI Khorana.. 

"Specific amino acid substitutions in bacterioopsin: re- 
placement of a restriction fragment, in the structural 
eene bv synthetic DNA fragments containing altered 
codons" Proc. Nad Acad. Set.. USA, 8 1 : 2 23 5 - 7289 (1984). 

Wells, J. A., M. Vasser, and D. B. Powers, "Cassette mu- 
tagenesis: an efficient method for ^feneration of multiple 
mutations at defined sites." Gene, 34: 315 -32 3 { 1 98^1 

Wharton, R-, and M Ptashne. "Changing the. binding spec- 
ificity of a repressor by redesigning an alpha-helix." 
Nature, 316:601-605 (1 98 5 ). 

Reidhaar-Olson J. F , and R. T. Sauer. "Combinatorial cas- 
sette mutagenesis as a probe of the informational con™ 
tent of protein sequences/ 1 Science, 241: 53 —57 (1988)- 

DOPED OLIGONUCLEOTIDE MUTAGENESIS 

McNeil J. and M. Smith. " Saccharomyces cerevisiae CYCl 
mRNA 5'-end positioning: analysis by in vitro muta- 
genesis, using synthetic duplexes with random mis- 
match base pair." MoL Cell. Biol, 5: 3545-3551 (1985). 

Hutchison, C. A., S. K. Nordeem K. Vogt ? and M H. Edgell. 
"A complete library of point substitution mutations in 
the glucocorticoid response element of mouse mam- 
mary tumor virus." Proc. NatL Acad. Set. USA, 83: 710- 
714 (1986). 

GENE SYNTHESIS . 

Ferreti, L., S. S. Kamik, H. G. Khorana, N. Nassal, and D 
D. Oprian. "Total synthesis of a gene for bovine rho- 
dopsim" Proc. Nad Acad. ScL USA, 83: 599-603 (1.986). 



Carurhers, M. H., A. D. Barone, S. L. Beaucage, D. R. Dodds, 
£. F. Fisher, L. J. McBride, M.. Matceucci, Z. Stabinsky, 
andJ.-Y. Tang. "Chemical synthesis of deoxyoligon- 
ucleotides by the ' phosphoramidite method:' Meth. Rn- 
■zymoL 154: 287-313 (1987). 

Abate, C. T D. Luk, R. Getz, FJ. Rauscher, III, and T. Outran. 
"Expression and purification of the leucine zipper and 
DNA -binding domains of Fos and Tun: both Fos and 
Jun contact DNA directly." Proc. NatL Acad. Set. USA, 
87: 1.032, 1036 (1990).. 

CONSTRUCTING CHIMERIC GENES BY PGR 

Higuchi, R., B. Knimmel, and R. K. Saiki. "A general method 
of in vitro preparation and specific mutagenesis of 
DNA fragments: study of protein and DNA interac- 
tions/ 5 Nuc. Acids Res, 16: 7351-7367 (1988). 

Ciacksom T, and G. Winter. "'Sticky feet'-directed mu- 
tagenesis and its application to swapping antibody do- 
mains; 7 Nuc. Adds Res., 17: 10163-10170 (1989). 

PROTEIN STRUCTURE AND FUNCTION 

Winter, G, A. R. Fersht, A. J. Wilkinson, M. Zolier, and 
M Smith. "Redesigning enzyme structure by site-di- 
rected mutagenesis;' Nature, 299: 756-758 (1982). 

Cunningham, B. C, P. Jtmrani, P. Ng, and J. A. Wells. 
"Receptor and antibody epitopes in human growth hor- 
mone identified by hornolog-scanning mutagenesis." 
Science, 243: 1330-1336 (1989). 

Gibhs, C. S., and M. J. Zoller. "Rational scanning mutagene- 
sis of a protein kinase identifies functional regions in- 
volved in catalytic and substrate interactions." J- BtoL 
Cbem; t 266: 8923 -8931 (1991)- 




3in.ciijtt 





stry 



soon as the first successful cloning experiments were reported in 1973, 
applications for this powerful technology quickly followed. The signifi- 
cance of being able to produce large quantities of human proteins that were 
normally available in exceedingly small amounts, if at all, was not lost on 
scientists, physicians, and businessmen alike. In 1976 biotechnology became a 
reality as the methodologies for DNA cloning, oligonucleotide synthesis, and 
gene expression converged in a single experiment, in which a human protein 
was expressed from recombinant DNA for the first time. The protein was 
somatostatin, a 14 amino acid peptide neurotransmitter. The gene encoding 
somatostatin was not the natural gene but was synthesized. chemically and cloned 
into a plasmid vector for expression in E. coll Soon after followed the successful 
expression of human insulin for the treatment of diabetes, the first commercial 
product of the biotechnology industry. Instead of insulin extracted from the 
pancreases of pigs and cows, diabetics could now receive insulin identical to 
that normally produced by humans. 

The ability to achieve such feats relied on the successes in all areas of molecular 
biology, including oligonucleotide synthesis,' isolation of enzymes that cleave 
and join DNA, characterization of bacterial plasmids, and an understanding of 
gene expression. These methods have, of course, revolutionized research m 
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biology and medicine, but what is equally irruxwmif 
cney .nave spawned an entirely new industry, one de- 
vote 4 t0 the cloning and production of proteins of 
importance to both medicine and industry. Today, 
proteins are produced through recombinant DNA 
technology for treatment of numerous diseases--- can- 
cer, allergies, autoimmune disease, neurological dis- 
orders, heart attacks, blood disorders, infections, 
wounds, and genetic diseases— as well as for more 
prosaic tasks, such as use m laundry detergents and 
food production. In addition, entirely new approaches 
to drug design have emerged from recombinant DNA 
technology; as scientists have gained the ability to 
tinker with natural proteins to improve their function 
and to change them m subtle and useful ways. 



Expression Systems Are Developed to 
Produce Recombinant Proteins 

Cloning the gene or c DNA encoding a particular pro- 
tein is only the first of many steps needed to produce 
a recombinant protein for medical or industrial use. 
The next step is to put the gene into a host cell for 
production. The development of expression systems 
has been an important research area in both industrial" 
and academic laboratories. The most popular expres- 
sion systems are the bacteria £ colt and Bacillus sukilis, 
yeast, and cultured insect and mammalian ceils. We 
have learned in earlier chapters about the development 
of vectors and DNA transformation methods for these 
organisms. Here we will discuss the issues that are 
important for protein production. The choice of which 
cell is used depends on the project goals and on the 
properties of the protein to be produced. 

Bacterial cells offer simplicity, short generation 
times, and large yields of product with low costs. And, 
especially with B subtilis, the cells can be induced to 
secrete the product into the culture medium, thus 
greatly simplifying the task of purification. But expres- 
sion in prokaryotic cells has several drawbacks. Al- 
though some proteins are expressed to"" high levels 
(greater than .10 percent of the mass of all "bacterial 
proteins), they often fail to fold properly and hence 
form insoluble inclusion bodies. Protein extracted from 
these inclusion bodies is often biologically inactive. 



Small proteins can sometimes be refolded into their 
active forms, but larger protcm.s usually cannot. A 
second problem is that foreign proteins are sometimes 
toxic co bacteria, so cell cultures producing the protein 
cannot be grown to high densities. This problem can 
often be circumvented by using an inducible promoter 
that is aimed on to begin transcription of the gene 
for the foreign protein only after the culture has been 
grown. Third, bacterial cells lack. - enzymes that are 
present m eukaryotic cells and add posttranslational 
modifications, such as phosphates and sugars, to pro- 
teins. These modifications are often required for 
proper functioning of proteins. Researchers are ad- 
dressing this problem by purifying the eukaryotic en- 
zymes that carry out these modifications and using 
these enzymes _to add the needed modifications to 
bacterially exptessed proteins. 

Yeast has been used for centuries by brewers and 
bakers, and now it toils for biotechnologists as well. 
As discussed in Chapter 13, yeast is a simple eukaryote 
that resembles mammalian cells in many ways but can 
be grown as quickly and cheaply as bacteria can/ Yeast 
perform many of the posttranslational modifications 
found on human proteins and can be induced to secrete 
certain proteins into the growth medium for harvest- 



ing. 
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isaavantage oi yeast is the presence of active 
proteases that degrade foreign proteins, thereby re- 
ducing the yield of product. Researchers are dealing 
with rhis problem, however, by constructing yeast 
strains in which the protease genes have been deleted 
Expression of heterologous proteins in insect cells 
by baculovirus vectors (as previously described in Fig- 
ure 12-12) is a relatively new approach. The main 
advantages are high-level expression, correct folding, 
and posttranslational modifications similar to those in 
mammalian cells. A vaccine for the AIDS virus has 
been prepared by producing one of the HIV glyco- 
proteins with this system. Although the cost of cuD 
turmg insect cells is currently more than that for 
culturmg bacteria and yeast, it is less than that for 
culturing mammalian cells. 

Despite the significant advantages .of producing hu- 
man proteins in heterologous host cells, in some cases 
the best place to produce a mammalian protein is in' 
mammalian cells. Great improvements have been 
made to promoters, vectors, transformation protocols, 
and host cell systems. Transient expression in mam- 



jmalian cells (described in Figure 12-4) is often used 
for checking; the function of a newlv cloned, eerie and 
as a quick method for assessing the function of en- 
gineered proteins. The extracellular domains oi cell- 
surface receptors (Chapter 17) have been, engineered, 
for secretion from, cells by introducing a stop codon 
into the gene before the transmembrane domain se- 
quence. These soluble receptors are valuable reagents 
for studying li^and binding in vitro and for screening 
for receptor agonists or antagonists, and they may 
eventually be used as therapeutics themselves. Al- 
though transient systems yield enough protein, for lab- 
oratory experiments, stably integrated amplified genes 
in mammalian ceils are used for the large-scale pro- 
duction of proteins such as tissue plasminogen acti- 
vator, which we describe later. 



Insulin Is the First Recombinant Drug 
Licensed for Human Use 

The first licensed drug produced through genetic en- 
gineering was human insulin. An important hormone 
that regulates sugar metabolism, insulin is produced 
by a sm all number of ceils in the pancreas and secre ted 
into the bloodstream. An inability to produce insulin 
results in diabetes, but daiiv injections of insulin are 
sufficient to reverse or at least allay the debilitating 
effects of the disease. Prior to production of the re- 
combinant molecule, insulin for treatment of diabetes 
was obtained from the pancreases of pigs and cows. 
Although this insulin is biologically active in humans, 
the amino acid sequences are not identical to that of 
the human molecule. Thus, some patients produced 
antibodies against injected insulin, occasionally re- 
sulting in serious immune reactions. Because recom- 
binant human insulin is identical ■ to the natural 
product, immunogenicity should not be. a problem. 

In mammals, insulin is expressed as a single-chain 
prepro-hormone, which is secreted through the plasma 
membrane. A prepro-hormone contains extra ammo 
acids not present in the mature hormone. Amino- 
termin al amino acids form the pre sequence and target 
the expressed protein for secretion. The pro sequence 
is a stretch of amino acids in the middle of the hormone 
sequence that is important for folding the polypeptide 



chain into the correct, structure. During secretion, 
these extra amino acids are cleaved from the prepro- 
hormone by cellular proteases to release the mature 
insulin molecule, consisting of two short polypeptide 
chains, A and B 5 linked by two disulfide bonds. The 
principal challenge in the production ot recombinant 
insulin was getting insulin assembled into this mature 
form. The initial approach was to construct synthetic 
genes from oligonucleotides that separately encoded 
the A and B chains. These were individually inserted 
into the E. coli gene encoding /J-galactosidase, so the 
bacteria produced large fusion proteins that had the 
insulin sequences tacked onto the end of the /?- 
galactosidase enzyme (Figure 23-1). These large pro- 
teins were purified from bacterial extracts, and the 
insulin chains were released by treatment with cyano - 
gen bromide, a chemical that cleaves peptide bonds 
following methionine residues. Because a methionine 
codon had been inserted at the boundaries between 
jS-calacrosidase and the insulin chains in the fusion 
proteins, cyanogen bromide treatment clipped intact 
insulin chains off the fusion proteins. These were pu- 
rified, mixed, and reconstituted into an active insulin 
molecule. This approach was refined by producing a 
single /i-galactosidase -insulin fusion protein, which 
could be cleaved in. a single step to release mature 
insulin. A similar method is now in use for the com- 
mercial production of recombinant insulin. 



Recombinant Human Growth 
Hormone is Produced in 
Bacteria by Two Methods 

Growth hormone is a 191 amino acid protein that is 
produced in the pituitary gland and regulates growth 
and development. Children born with growth hor- 
mone deficiency — hypopituitary dwarfs- -never 
achieve normal stature. Regular injections of growth 
hormone stimulate the growth of these children so 
that they reach near-normal heights Unlike the sit- 
uation with insulin, animal-derived growth hormones 
are ineffective. Only the -human protein works, and 
for many years it was painstakingly extracted from the 
pituitaries of human cadavers. One unforeseen and 
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unfortunate consequence of growth hormone treat- 
ment, however, was the infection of a number of chil- 
dren with a fatal vims from one of the cadavers. 
Production of recombinant human growth hormone 
(hGH) would clearly provide a safe, reliable, and plen- 
tiful source of this druQ" 

I ne initial production of hGH was achieved by 
constructing a hybrid gene from the natural hGH 
cDNA and synthetic oligonucleotides that encoded 
the amino terminus of the mature form of the protein 
(Figure 23-2a). This coding sequence was ligated into 



FIG If RE 23-1 

Expression of human insulin la £ coll. Recombinant insulin 
was first made by expressing the A and B chains separately 
then refolding them into a mature! insulin molecule A DNA. 
fragment encoding each insulin chain was made by anneal- 
ing two complementary oligonucleotides that had been, 
chemically synthesized. Each fragment was ligated into a 
bacterial expression vector so that, when translated, the. in- 
sulin chain would be fused to the carboxy -terminus of rhe 
enzyme /?~galactosidase (/?-gal). The expression vectors 

were transformed into £ coli, and the 8 gal insulin fusion 

proteins accumulated inside the bacterial cells. The cells 
were harvested, and each /?-gal-insulin fusion protein was 
purified- The insulin -coding DNA was synthesized so that it 
started with a methionine codon. This setup provided a way 
to cleave off the /f gal part from the insulin polypeptide. 
Treatment of the fusion protein with the chemical cyanogen 
bromide (CNBr) results in cleavage of peptide bonds after 
all methionines. In this way, the natural insulin peptides 
were obtained. Because /?-gal contains other methionine res- 
idues, CNBr treatment cleaved it into many small peptides. 
The insulin chains were not cleaved further because they 
did not contain internal methionines. The A and B chains 
were purified and then mixed together to form active re- 
combinant insulin. 



a plasmjd adjacent to a bacterial promoter. Like insulin, 
hGH is normally produced as a larger precursor pro- 
tein containing an ammo-term mal signal sequence. 
Because the human signal sequence would not be rec - 
ognized by the bacterial secretion machinery, the 5' 
end of the cDNA was reengineered with a synthetic 
DNA sequence enabling the bacteria to produce a 
nearly normal version of the mature human protein. 

The first hGH expression vectors directed the pro- 
duction of the protein inside the cell Purification re- 
quired many steps to separate hGH from the thousands 
of intracellular bacterial proteins. Another way to pro- 
duce the protein in bacteria is to engineer the protein 
so it is secreted. This can be done by linking the coding 
sequence for the desired protein to a signal sequence 
from a secreted bacterial protein, thus forming a pre- 
hormone (Figure 2 3-2b). Human growth hormone is 
produced by the bacteria and then secreted with the 
concomitant removal of the signal peptide by a bac- 
terial protease. Secretion into the periplasm, where 
there are fewer proteins than inside the cell, makes 
purification simpler. The only difference between the 
secreted hGH and rhat produced mtracelluiariy is the 
presence of an ammo-terminal methionine on the in- 
tracellularly expressed molecule. Because the secreted 
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FIGURE 23-2 

Bacterial production of human growth hormone (hGH). (a) An expression vector was con- 
structed for intracellular production of hGH. The coding sequence was constructed by iso- 
lating from die cDNA a DNA fragment that encoded amino acids 24-191 and ligating this 
to a synthetic oligonucleotide fragment that encoded amino acids L— 24. Following introduc- 
tion of the expression vector into bacterial cells, recombinant hGH was produced inside the 
cells- The expressed protein behaved just like natural human growth hormone but coin slued 
the initiator methionine at the amino terminus, (b) A protein can he produced in bacteria 
without this extra methionine by targeting it for secretion.. To do this, a DNA fragment 
encoding a. bacterial signal sequence, which specifies secretion of a bacterial protein, was 
placed in front of the hGH coding sequence. Upon introduction of this vector Into bacteria, 
hGH is produced, and the signal sequence targets the protein for secretion. The protein 
accumulates in the periplasmic space 'between the inner and. outer bacterial membranes and 
can be released by hypotonic disruption of the outer membrane. In contrast to the intracellu- 
lar form of hGH, the protein produced by this procedure does not contain an initiator me- 
thionine, since a periplasmic protease cleaved off the signal sequence. 
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-form lacks this methionine, it: is called met- less hGH. 
Bacteria Hy expressed hGH has been administered to 
thousands of growth hormone-deficient children, who 
have benefited greatly from this recombinant drug. 



A Hepatitis B Virus Vaccine Is - 
Produced, m Yeast by Expression 
of a Viral Surface Antigen 

One of the successes of modern medicine is the de- 
velopment and implementation of vaccines apamstim 
fectious diseases. Prior to the advent of recombinant 
DNA technology; two types of vaccines were used. 
Inactivated vaccines are chem.xca.Uy killed derivatives 
of the actual infectious agent. Attenuated vaccines are 
live viruses or bacteria altered so that they no longer 
multiply in the maculated " organism. Both types of 
vaccines work by presenting surface proteins (anti- 
gens) to B and T lymphocytes, which become primed 
to respond rapidly should the organism actually be- 
come infected, usually destroying the infectious agent 
before any damage is done (Chapter 16). However, 
these types of vaccines are potentially dangerous be- 
cause they can be contaminated with infectious or- 
ganisms. For example, a small number of children each 
year contract polio from their polio vaccinations. Thus, 
one of the most promising applications of recombinant 
DNA technology is the production of sub unit vaccines, 
consisting solely of the surface protein to which the 
immune system responds. With a subunit vaccine, 
there is no risk of infection. 

The first successful subunit vaccine was produced 
for hepatitis B virus (HBV), which infects the liver 
and causes liver damage and, in some cases, cancer. 
The virus particle is coated with a surface anticen 
HBsAg, and infected patients carry large aggregates 
of this protein in their blood. Early experiments sug- 
gested that these aggregates' would make a potent 
vaccine, but how could they be produced in quantities 
sufficient to vaccinate large populations against HBV? " 
With the cloning of the HBV genome, the possibility 
of a subunit vaccine could be explored. Initial attempts 
to produce the HBsAg protein in £ colt failed, so 
researchers turned to yeast. The HBsAg gene .was 
inserted into a high-copy yeast expression vector (Fig- 



ure 15-3) and engineered, in this case, so that it would 
not be secreted (Figure 23-3). Yeast transformed with 
this plasmid produced, large quantities of die viral 

protein (about [ 2 percent of the total yeast protein). 

By growing the yeast in large fer mentors, it was pos- 
sible to produce 50 100 mg of the protein per liter 

of culture. This recombinant protein closely resembled 
the natural viral protein; it even formed aggregates 
with properties similar to those of the immunogenic 
aggregates found in infected' patients. The yeast pro- 
tein is now used commercially to vaccinate people 
against HBV infection. 

Vaccines agamst many human and animal patho- 
gens are currently in various stages of development. 
Recombinant DNA technology has provided a safe 
means to work with and to inoculate children and 
adults with only noninfectious parts of infectious 
agents. In Chapter 25, we will discuss various strategies 
for the development of a vaccine against the AIDS 
vims. 



Complex Human Proteins Are 
Produced by Large-Scale 
Mammalian Cell Culture 

Most of the recombinant proteins we have discussed 
thus far in this chapter are relatively small and simple 
in both structure and function. Other proteins of med- 
ical interest are considerably more complicated in 
structure and function, and biologically active proteins 
have proved difficult to produce in bacteria and yeast 
In these cases, biotechnology companies have resorted 
to using mammalian cells for protein production. 
Mammalian cells are finicky and expensive to grow, 
but they can be counted on to produce correctly mod-, 
ifietl fully active proteins. Thus, much effort in the 
biotechnology industry has been devoted to setting up 
fermentor systems for large-scale culture of mam- 
malian cells. 

The first drug to be produced commercially by 
mammalian cell culture was tissue plasminogen activator 
or tPA, which is administered to heart attack victims. 
Tissue plasminogen activator, is a protease, an enzyme 
that cleaves other proteins. It works by clipping plas- 
minogen, an inactive precursor protein, to form plasmin, 
itself a potent protease that degrades fibrin, the protein 
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FIGURE 23-3 

Production of a subunit vaccine in yeast. Hepatitis B virus 
(HBV) is encoded by a small 3 2 -kb genome that has been 
cloned and sequenced. Both the whole vims and a smaller 
HBsAg (HBV surface antigen) particle ate found in the 
blood of infected patients. To prepare a vaccine against 
HBV, which has been difficult to propagate in culture, the 
HBsAg gene was cloned into a vector for expression in the 
yeast Saccbaromyces cerevzsiae. Transcription occurs from the 
strong promoter from the gene encoding alcohol dehydro - 
genase L A transcription terminator -was' placed downstream. 
The vector contains replication origins and markers for both 
bacteria and yeast Yeast transformed with this plasmid can 
be grown to high ceil densities in fermentors. This process 
results in the accumulation of large amounts of HBsAg 

lOtClUj wlllUl <J.pOIl [JUUULilLluii waS i'..'uud « £ ^ j. ^- £ " 

into particles about 20 nanometers in diameter, resembling 
the particles found in HBV-infected patients: 



char forms blood clots. Rapid administration of a plas- 
minogen activator after a heart attack dissolves the 
life-threatening clots that lead to irreversible damage 
of heart muscle. Tissue plasminogen activator is com - 
mercially produced from a mammalian cell line car- 
rying a stably integrated, highly amplified expression 
vector (Figure 23 4). 

Another protein being produced by mammalian cell 
culture is Factor VIII, a protein required for normal 
clotting of the blood. Genetic defects in Factor VIII 
production, are responsible for hemophilia. For many 
years, hemophiliacs have been treated with Factor VIII 
purified from human blood. With the contamination 
of the human blood supply by the AIDS virus, how- 
ever, thousands of hemophiliacs became infected and 
hundreds died from AIDS. The Factor VIII cDNA 
had already been cloned before scientists found that 
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the blood supply was contaminated with the AIDS 
virus. Recognition of the need for a safer source of 
Factor VRI accelerated efforts already under way to 
produce the protein using recombinant DNA methods. 
Like tPA, Factor VIII. is a large and complex protein 
and can only be efficiently produced m mammalian 
ceil culture. But the availability of recombinant protein 
will spare future generations of hemophiliacs from 
infectious agents that contaminate the blood supply. 
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Monoclonal Antibodies Function 
as "Magic Bullets 77 

We have discussed the use of biotechnology to pro- 
duce novel vaccines that elicit antibody production 
by the body's immune system. As we learned in Chap- 
ter 1 6, antibodies are exquisitely selective proteins that 
can bind to a single target among millions of irrelevant 
sites. Researchers have long dreamed of harnessing the 
specificity of antibodies for a variety of uses that re- 
quire the targeting of drugs and other treatments to 
particular sites in the body. It is this use of antibodies 
as targeting devices that led to the concept of the 
"magic bullet/ 1 a treatment that could effectively seek 
and destroy tumor ceils and infectious agents wherever 
they resided. 

The major limitation in the therapeutic use of an- 
tibodies is producing a useful antibody m large quan- 
tities. .Initially, researchers screened myelomas, which 
are antibody-secreting tumors, for the production of 



FIGURE 23-4 

Production of tissue plasminogen, activator (tPA) fay mam- 
malian cell culture. The cloned cDNA for human tPA was 
ligated mto an expression vector that contained a strong 
promoter and terminator. The vector was stably transferred 
into a mammalian ceil line. The initial transform ants 
secreted tPA into the culture medium, but the level of 
expression was very low.. Cell lines that expressed tPA to 
high levels were obtained using methotrexate treatment, 
which selects for cells that have amplified the dhfr gene ■ 
resident in the vector together with the linked tPA expres- 
sion cassette (Chapter • 12). High-expressing lines are grown 
in large fermentors and recombinant tPA is purified from, 
the culture medium. 
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FIGURE 23-5 

Production of. a monoclonal a nobody (MAbl.. A mouse is 
inoculated with an antigen (Acn of interest. This stimulates 
the proliferation of lymphocytes expressing antibodies 
against the antigen... Lymphocytes are taken from the spleen 
and fused to myeloma cells by treatment with polyethylene' 
glycol. Hybrid cells are selected by growth in HAT medium 
(Chapter 12), The myeloma cells lack, the enzyme HPRT 
and thus die in this medium unless they become fused with 
a lymphocyte, which expresses the missing enzyme. Un fused 
lymphocyte cells soon die off as well, because they do not 
f?row for lonff in culture. Individual hybrid cells are 
transferred to the wells of a microuter dish and cultured 
for several days. Aiiquots of the culture fluids ate removed 
and tested for the presence or antibody (Ah) that binds the 
antigen. Cells that test positive are cultured for monoclonal 
antibody production. Antibody -producing cell lines are 
stored frozen in liquid nitrogen (this process is called cell 
hanking}., Aiiquots can be thawed out and cultured as needed. 



is inoculated with the antigen to which an antibody 
is desired. After the animal mounts an immune re- 
sponse to the antigen, its spleen, which houses anti- 
body-producing cells (lymphocytes), is removed, and 
the spleen cells are fused en masse to a specialized 
myeloma cell line that no longer produces an antibody 
of its own. The resulting fused cells, or hybridomas, 
retain properties of both parents. They grow contin- 
uously and rapidly in culture like the myeloma cell, 
yet they produce antibodies specified by the lympho- 
cyte from the immunized animal Hundreds of hy- 
bridomas can be produced from a single fusion 
experiment, and they are systematically screened to 
identify those producing large amounts of a desired 
antibody. Once identified, this antibody is available in 
limitless quantities. Monoclonal antibodies are already 
widely used for the diagnosis of infections and cancer 
and for the imaging of tumors for radiotherapy. And 
investigations into their use in the direct treatment of 
cancer, inflammation, and immune disorders is on the 
rise. 



useful antibodies. But they lacked a means to program 
a myeloma to produce an antibody to their specifi- 
cations. This situation changed dramatically with the 
development of monoclonal antibody technology. The 
procedure for producing monoclonal antibodies, or 
MAbs, is shown in Figure 23.-5. First a mouse or rat 



Human Antibodies That Recognize 
Specific Antigens Can Be Directly 
Cloned and Selected 

One new application of monoclonal antibody tech- 
nology is the generation of abzymes, antibodies that 
behave like enzymes to catalyze a chemical reaction. 



462 



C H A P T £ R 



FIGURE 2.3-6 

Direct cloning of antibody cDNAs by PGR. To engineer sn 
antibody, the amino acid sequence of the variable domain 
needs, to be determined. This could be done by sequencing 
a purified preparation of che heavy- (H) and light- (L) chain 
proteins, but a simpler method is to deduce the sequence 
from the cloned cDNA. In the past, a cDN'A library was 
prepared from hybridoma rnRNA and screened with probes 
from the constant regions of the H and L chain genes, A 
simpler method has been developed that uses the. PGR. 
From a comparison of a large number of antibody 
sequences, ammo acids frequently found at the amino 
termini of antibodies were identified. From rhis information, 
a set of degenerate PGR primers was designed that cor- 
respond to all the possible sequences in this region. Because 
the amino acids in the constant domains of different anti - 
bodies are aearly identical, only one PGR. primer is needed 
for che 3' end of each H and L chain sequence. To directly 
clone the antibody cDNAs, cDNA is prepared by treating- 
hybridoma rnRNA with reverse transcriptase, mixed with a 
pair of PGR primers (in. this case, for amplifying 1 the 
heavy chain sequences), and subjected to PCR. Without 
knowledge of the amino ter minus of the antibody chain, a 
PCR had to be set up with each of the different 5' primers 
until an amplified DNA fragment was obtained. The pro- 
cess can be simplified if the sequence of the first six or 
seven amino acids of the antibody can be determined; this 
is -sufficient to design a single 5 f PCR primer. 
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Enzymes catalyze reactions by stabilizing a chemical 
structure intermediate between the substrate and 
product, termed the transition state. Thus, if monoclonal 
antibodies could be made to a transition state ana- 
logue—a molecule resembling the transition state of 
a chemical reaction -then some of these antibodies 
might have catalytic activity. The ability to produce, 
custom-designed catalysts would be very valuable, es- 
pecially to the chemical and pharmaceuticalindustries. 

Initial attempts to produce catalytic antibodies in- 
dicated that they were exceedingly rare and often not 
found among the hybridomas produced by conven- 
tional monoclonal antibody technology. An excellent 
fusion might produce several hundred different an- 
tibodies, but the entire repertoire of antibodies that 
can be produced by the immune system is perhaps 
100 million. How can the entire repertoire be tapped? 

One strategy that shows proxuise is to bypass the 
inefficient fusion step in hybridoma production and 
directly clone antibody cDNAs from die lymphocytes 
of immunized mice (Figures 23-6 and 2 3-7). Inves- 
tigators inoculated a mouse with an antigen. They 




Cloned heavy chai 



recovered spleen cells from the mouse and used PCR 
to amplify millions of cDNAs for antibody light and 
heavy chains. The light- and heavy-chain cDNAs were 
cloned separately into phage vectors and then recom- 
bined in vitro to generate a third, combinatorial library 
of phage carrying random pairs of light and heavy 
chains. The library was plated onto a bacterial lawn, 
and the resulting phage plaques, each containing a 
unique antibody ? were screened with radioactively la- 
beled antigen in a manner similar to that used for 



Recombinant: DNA in Medicine and Industr 



■d.A 



) D 



Lymphocytes from 
immunized mouse 



m 



RNA 



Reverse franscnotase 



DNA 



Amplified 
heavy chain ■< 
cDNAs 



PCR 



PCR. 




Amplifi ed 
light chain 
cDNAs 



Ligate H and L. chain cDNAs 
into A. expression vector 



H cham cDNA ^ L chain cONA 

Package into phage 





CO to 




J Combinatorial 
phage library 



infect E. colt 

H and L chains are 

produced in infected cells 




Prepare replica Filter 



Add labeled antigen 
to Filter 



•~7 



■d in 



1r ' are removed from an 



syritne- 



FIGURE 2 3 

C rearing a combinatorial library of antibodies exoress> 
£ ^/r. Lymphocytes or spleen cell 
immunized animal... mRNA is ob rained and cDNA 
sized with reverse transcriptase. The heavy- (H) and light-- 
(L) chain genes are separately amplified by PCR, as shown, 
in Figure 23-6, and h gated into k cloning vectors. Two dif- 
ferent libraries are produced., one containing the H chain 
genes and one containing rhe L chain genes (this step has 
been omitted from the figure for simplicity). Phage DNA is 
isolated from each library, and the H and L chain sequences 
axe ligaced together and packaged to form a combinatorial 
library. Each phage now contains a random pair of H and 
L chain cDNAs and thus upon infection of E. colt directs 
the expression of the two antibody .chains in infected cells. 
Since the H chain sequence contains only the variable 
region and the first constant domain, the antibody that 
forms is called a Fab, for antigen binding fragment k binds 
the antigen much like an intact antibody but it lacks the 
effector domain. To identify an antibody that recognizes the 
antigen, the phage library is plated, and the antibody (Fab) 
molecules present in the plaques are transferred to filters. 
The filters are incubated with radioacrively labeled antigen 
and then washed to remove excess unbound ligand. A 
radioactive spot on the autoradiogram identifies a plaque 
that contains an antibody that binds the antigen. A recent 
procedure uses the phage display technology, described in 
Figure 23-L0 7 to select antibodies with desired properties. 
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cloning cDixAs from an expression library (Figure 
7-10). (Jut of a million phage plaques screened, 200 
clones were identified that produced an antibody bind- 
ing the antigen. Thus, with this approach, investigators 
were able to sample a million -possible antibodies — 
at least a thousand times more than they could screen 
by ; conventional monoclonal antibody technology. 
Since phages in a particular plaque encode the anti - 
body expressed in the plaque, it is a trivial matter to 
clone the heavy- and light- chain c DMAs from the 
phage DNA. These cDNAs can. be placed into bac- 
terial or mammalian expression vectors for production 
of large quantities of the selected antibody. 

A recent modification of this method uses hi a men - 
tons phages such as Ml 3 instead of X phage and allows 
display of the antibodies on the phage surface. This 
offers the advantage of being able to screen thousands 
more phage (because the screening can be done in 
solution) and to select phage that express tight-binding 
antibodies- We will discuss this method later and in 
Figure 23-10. 
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Humanized" Monoclonal Antibodies 
Retain Activity But Lose 
Irnmunogenicity 

Although swift progress is being made in the identi- 
fication of monoclonal antibodies with potential ther- 
apeutic value, their use is limited by a problem we 
have already discussed in this chapter. Monoclonal 
antibodies are usually mouse proteins, and they are 
not identical to human antibodies. Thus, antibodies 
injected into a patient will eventually be recognized 
as foreign proteins and will be cleared from the 
circulation. 

As we learned in Chapter- 16, both chains of the 
antibody molecule can be divided into variable and 
constant regions. The variable regions differ in se- 
quence from one antibody to another, and this' is the 
region of the protein that binds the antigen. The con- 
stant region is die same among all antibodies of the 
same type. The first method used to reduce the irn- 
munogenicity of a mouse monoclonal antibody was 
simply to construct chimeric genes that encoded pro- 
teins in which the variable regions from the mouse 
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FIGURE 2 3 8 

Antibody engineering. The bask structure of a mouse mono- 
clonal antibody (MAb) resembles that of a human antibody. 
However, there are numerous deferences between amino 
acid sequences of the antibodies from the two species„ 
These sequence- differences account for the irnmunogenicity 
of mouse MAbs m humans. A chimeric MAb is constructed 
by figatmg the cDNA fragment encoding the mouse V L and 
V H domains to fragments encoding the C domains from a 
human antibody. Because the C domains do not contribute 
to antigen binding, the chimeric antibody will retain the 
same antigen specificity as the original mouse MAb but will 
be closer to human antibodies in sequence. Chimeric MAbs 
still contains some mouse sequences, however, and may still 
he immunogenic. A humanized MAb contains only those 
mouse amino acids necessary to recognize the antigen. 
This product is constructed by building into a human 
antibody the amino acids from the mouse complementarity 
determining regions or CDRs. 



antibody were fused to the constant regions from a 
human antibody. The chimeric antibody (Figure 
23-8) retained its binding specificity but more closely 
resembled a natural human antibody. 

This antibody, however, was not fully humanized, 
because it retained amino acid sequences from the 
mouse protein. Thus, scientists have set out to engi- 
neer fully humanized monoclonal antibodies that will 
be indistinguishable from natural molecules. Extensive 
studies of the three-dimensional structures of antibody 
molecules tell us that only a few of the one hundred 
amino acids in the variable region of an antibody 
actually contact the antigen; these regions of contact 
are referred to as complementarity determining regions 
(CDRs). Three CDRs each comprise the antigen- 
binding sites on the light and heavy chains- The rest 
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of the variable region serves as a scaffold to anchor 
i he CDRs in the correct positions. This breakdown, 
of amino acids in the variable region into those serving 
recognition and those serving structural roles is also 
evident from simply comparing thesequenc.es of many- 
antibody molecules- Amino acid sequences in the 
CDRs are hypervariable, whereas the structural, or 
framework, amino acids differ little. 

Thus, to make a fully humanized antibody, ail that 
would be required in principle would be to use in 
vitro mutagenesis to transfer the CDR amino acid, 
sequences from a mouse MAb to a natural human 
antibody (Figure 2 3-8). Tins method was used to hu- 
manize an antibody that recognizes an antigen on the 
surface of human lymphocytes. This humanized MAb 
is now m clinical trials as an immunosuppressant and 
for treatment of lymphoid tumors. Another potentially 
valuable MAb binds a growth factor receptor found 
in large numbers on the surface of many breast tumor 
cells. Laboratory experiments showed that this anti- 
body could block the growth of these ceils in culture 
and caused tumors seeded in mice to regress. Unfor- 
tunately, the first hum anized versions of this antibody 
bound the receptor protein but failed to block the 
growth of breast carcinoma cells. Investigators sus- 
pected that the problem was with the framework amino 
acids, and they used computer modeling to design 
amino acid substitutions that would strengthen the 
antibody- antigen interaction. Several such variant an- 
tibodies were produced and tested; one bound the 
receptor 250 times more tightly than did the original 
antibody and successfully blocked tumor cell growth 
in culture. This antibody is now being produced in 
large quantities for clinical trials. 



Protein Engineering Can Tailor 
Antibodies for Specific Applications 

Humanizing monoclonal antibodies is an example of 
the emerging technology of protein engineering, that is, 
a process using recombinant DNA to modify the struc- 
ture of natural proteins to improve or change their 
function. Antibodies are particularly attractive can- 
didates for protein engineering, because their structure 



is understood in ore at detail and because their poten- 

O J. 

rial for use in medicine is enormous. Another way in 
which antibodies are being engineered is by changing 
their effector domains, the regions of the. heavy chain 
that specify antibody function — for example, killing 
of cells marked by the antibody; In this way, the mode 
of action of a monoclonal antibody can be repro- 
g-rammed. One promising strategy is to replace the 
effector domain entirely with a sequence encodings 
toxin. An antibody-toxin fusion protein would deliver 
the toxin specifically to cells bearing the target antigen. 
This product could be an exceptionally potent treat- 
ment for cancer and for viral diseases such as AIDS. 
Antibody engineering is also being used to construct 
bi specific antibodies.. In these antibodies, each of the two 
arms recognizes a different antigen, thus allowing an 
antibody to bridge the two antigens. For example, a 
bispecifte antibody could recognize a tumor cell pro- 
tein with one arm and a protein on the surface of a 
killer T ceil with the other, thereby bringing the killer 
cells directly to the tumor (Figure 23-9). 



Protein Engineering Is Used to 
Improve a Detergent Enzyme 

Subtilisin is a serine protease produced by bacteria. 
Due to its broad specificity for proteins that commonly 
soil clothing, this enzyme was developed for com- 
mercial use in laundry detergents. (It is subtilisin that 
is prominently advertised as the enzyme additive in 
modern detergents.) But the first detergents containing 
subtilisin suffered from a serious drawback: they could 
not be used with bleach, because bleach inactivates 
the enzyme. Biochemical analysis determined, drat loss 
of activity was due to the oxidation of a methionine 
at position 222. Once this happened, the modified 
enzyme lost 90 percent of its activity. Because they 
knew which amino acid was bleach sensitive, however, 
scientists decided to see whether a variant of subtilisin 
could be produced that was no longer sensitive to 
bleach. 

To do this, site directed mutants were constructed 
in the gene encoding subtilisin. The strategy was sim- 
ply to substitute, one at a time, each of the non— wild- 
type amino acids at residue 222. The mutant genes 




FIGURE 2 3-9 

A bispecifk antibody. By using recombinant DNA, the 
cDNAs for antibodies to two different antigens can be engi- 
neered to make an antibody in" which each arm recognises a 
t different antigen. Thus it is possible to re combine antibodies 
to 'surface antigen on tumor cells and to a protein on cyto- 
toxic T ceils to make a bispecinc antibody that brings the 
two cells together to facilitate killing of the tumor cells. 



were cloned into expression vectors arid the 19 dif- 
ferent subtilisin derivatives were expressed. Biochem- 
ical analysis showed that the cysteine- 2.2 2 enzyme was 



even more active, than the wild-type protein, but a 
was also inactivated by bleach. The next most active 
variant was the alanme-substiruted enzyme, which was 
53 percent as active as wild-type subrijisin. This vari- 
ant exhibited no detectable bleach sensitivity, so de- 
tergents containing this engineered subtilisin. can now 
be used with bleach. This new variant of subtilisin is 
an example of a second-generation molecule, a molecule 
specifically engineered for a new desirable trait. Pro- 
tein engineers are currently at work on a third-gen- 
eration molecule that exhibits decreased temperature 
sensitivity so that it can be used in hot water. 

This e xperiment points out the power of recom- 
binant DNA as a tool for the engineering of natural 
products. Changing the properties of a proteintvasTll 
but impossible -prior to the deveTojjim^ 
binant DNA techniques. Now it is not only possible, 
but easyTTFTs a routine exercise for pr oTeiri engineers 
to generate hundreds of variants of a natural " pnJonn 
foTTestingV "THese* crnlngeTTm 

Eased oiTaetailed knowledge of the structure of "a 

protein; afternauvH}^ 

iTpnirel^ 

section, a combination of structural iiaForiaa^ 
rancTom mutagenesis' arid' a po wernalleleH 
proverl protein function can ha ve 'oh^nar icTe^^ 



Growth Hormone Variants 
with Improved Binding Are 
Selected by Phage Display 

To engineer an improved subtilisin enzyme, research- 
ers were aided by the knowledge that only one specific 
ammo acid had to be changed Thus, they could sys- 
tematically vary that amino acid to find the one that 
worked the best. But more complex challenges face 
protein engineers. Is it possible, for/ example, to en 
gineer" antibodies with higher affinity for antigen- to 
design an inhibitor that tightly binds to and blocks a 
cell-surface protein or an enzyme insrde a cell; to 
.generate a growth factor or hormone with increased 
affinity for its receptor? Alterations of this sorr require 
several amino acid changes, and with 20 possible amino 
acids at each position, the number of variants that 
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need to be screened is enormous (for changes at just 
3 amino acids, there axe 8000 different combinations; 
for 10 amino acids, 10 n different proteins are possible). 
Clearly, these variants cannot be made and tested one 
at a time, and a method for direct selection of improved 
proteins is needed. 

Researchers have used a new approach to select 
variants of human growth hormone with increased 
affinity for growth hormone receptor (Figure 23-10). 



FIGURE 2 3-10 

Expression of proteins and pepaces on che su rf ace or 
filamentous phage, A library of randomly mu tared hGH 

cDNAs was U gated into an M 1 3 based phage in. id vector so 

that hGH was fused to die cat'boxy-terminal domain of die 
Ml 3 gene III protein. The ca.rboxyterau.nus of che gene III 
protein associates with the phage particle, and the ammo 
terrain us s con taming the hGH variants, is displayed on the 
outer surface of the phage. The. library of ph.agem.ids is 
Introduced into E. coli t and ampiciUin.~~resi.sunx colonies are 
obtained.. These E. cok are then infected with a helper phage 

that induces the production of phagemid particles. Only 1- 

• 10 percent of the phage particles contain an hGH -gene III 
fusion protein, and these contain only one hGH fusion 
molecule per phage. This ensures that the phage retain 
sufficient wild-type gene III protein in their coats to 
remain infectious, hGH-phage were passed through a 
column containing the hGH receptor covalendy Linked to 
plastic beads. Only the phage expressing hGH were re- 
tained. The nonbinding phage lacking hGH passed through 
the column. The bound phage were isolated, culm red in. 
E, coli, and ^passed, again over the column. Repeated rounds 
of selection resulted in the identification of hG H variants 
that bound the receptor with exceptionally high affinity. 



From structural studies and extensive mutagenesis of 
hGH, they knew what portions of the amino acid 
sequence were important for receptor binding- They 
synthesized degenerate oligonucleotides that encoded 
all possible amino acids at these positions and hgated 
the pool of oligonucleotides in place of the natural 
■hGH sequence. The resulting pool of variant hGH 
cDNAs was fused to the reading frame of gene 10 in 
the filamentous phage Ml 3. Gene III encodes a minor 
phage coat protein expressed on the surface of che 
phage, and incorporation of the hGH cDNA into this 
gene results in the display of the hGH variants on the 
surface of the phage, one variant per phage. This tech- 
nique is known as phage display. 

Now it was a simple matter to pass this library of 
more than 10 n different phage over a column con- 
taining the hGH receptor. Phage displaying weakly 
binding hGH variants were washed off the column, 
and phage displaying rightly binding variants were 
recovered with a more stringent wash. This population 
of tight-binding phage was amplified by infection of 
E. cbli and passed over the column a second rime. The 
selection was repeated, for a. total of six rounds, each 
round enriching for the phage displaying hGH variants 
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with highest affinity for the receptor bound to rhe 
column. At this point, individual phage were cloned, 
the affinities of their hGH variants were measured 
directly, and the sequences of the hGH cDNAs were 
examined. Among these variants was one that hound 
its receptor about 10 times more tightly than natural 
hGH did. When selected amino acids from another 
region of hGH that had been randomized were in - 
troduced into this variant, the resulting hGH molecule 
bound to the hGH receptor over 50 tunes more tightly 
than the wild-type hGH did. This process is being 
repeated m the hope to obtain even more tightly bind- 
ing variants. 

The ability afforded by techniques such as phage 
display to correlate protein structure and function in 
a systematic way makes possible new methods of find- 
ing novel drugs. If researchers have a good idea what 
combination of amino acids gives the best fit to the 
binding site on a receptor, the next step in rational 
drug design would be to design, or even select, a small 
peptide that binds as well as the larger protein. And 
then, using computer modeling to display the molec- . 
ular contacts between hgand and receptor, researchers 
can attempt to design and synthesize small nonprotein 
molecules that make the same contacts. The end- 
product would be a small organic molecule that could 
be produced more cheaply than a recombinant protein, 
yet would retain the full biological activity of the 
protein hormone. And, more important, such mole- 
cules could be administered orally, thus eliminating 
the major disadvantage of most recombinant protein 
therapeutics— that they must be delivered directly 
into the bloodstream by injection. This type of rational 
drug design contrasts sharply with the conventional 
approach to drug discovery now m use in the phar- 
maceutical industry, in which an inventory of com- 
pletely unrelated compounds is tested at random until 
an active compound is found. 
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.New Techno looies Promise New 
Approaches to Drug Design 

The biotechnology industry is in its infancy, and us 
successes to date folio w r directly from developments 
in molecular biology that are already nearly two de- 
cades old. The recombinant drugs currently in clinical 
use arise from what is by now conventional technol- 
ogy—gene cloning, expression, and mutagenesis to 
improve protein function. These methods will con- 
tinue to turn out new drugs such as erythropoietins 
to treat anemia caused by kidney disease, DNase to 
treat cystic fibrosis, or colony-stimulating factors 
(CSFs) to increase white blood cell production, durmg 
chemotherapy. 

But the true promise of biotechnology is in novel 
technologies that are only now being developed. We 
have mentioned efforts to design catalytic antibodies 
that can accelerate chemical reactions in both medical 
and industrial applications. This is but one example 
of a whole new approach to protein engineering in 
which novel activities can be placed on unrelated pro- 
tein scaffolds, using random mutagenesis coupled with 
selection methods like phage display. Similar goals 
may be achieved by the engineering of ribvzymes, RNA 
molecules with catalytic activity, and the use of the 
polymerase chain reaction to select nucleic acid mol- 
ecules that bind tightly to targets of medical impor- 
tance. Another strategy that may see widespread 
application is treatment with antisense DNA and RNA 
to inhibit the expression of oncogenes in tumors or of 
viral genes in infected patients. And a variety of new 
technologies based on viral vectors promise new ap- 
proaches for vaccines and gene therapy. 

Many of these techniques now work in the test 
tube, and the principal challenge facing biotechnology 
companies is to turn these laboratory techniques into 
commercially viable processes. 
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